Greenburgh
Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input
Masis, Tessa, O'Connor, Brendan
Geo-entity linking is the task of linking a location mention to the real-world geographic location. In this paper we explore the challenging task of geo-entity linking for noisy, multilingual social media data. There are few open-source multilingual geo-entity linking tools available and existing ones are often rule-based, which break easily in social media settings, or LLM-based, which are too expensive for large-scale datasets. We present a method which represents real-world locations as averaged embeddings from labeled user-input location names and allows for selective prediction via an interpretable confidence score. We show that our approach improves geo-entity linking on a global and multilingual social media dataset, and discuss progress and problems with evaluating at different geographic granularities.
- North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
- Asia > Middle East > Republic of Türkiye > İzmir Province > İzmir (0.04)
- Europe > Russia > Central Federal District > Moscow Oblast > Moscow (0.04)
- (33 more...)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.48)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)